Meta-learning for Indian languages: Performance analysis and improvements with linguistic similarity measures
نویسندگان
چکیده
Indian languages share a lot of overlap in acoustic and linguistic content. Though different use writing systems, the phoneme sets logically overlap. Most these are low-resourced, lacking enough annotated speech data to build good automatic recognition (ASR) systems. Recently proposed model-agnostic meta-learning (MAML) algorithm has shown great success fast adaptation multilingual models unseen datasets. In this work, we establish usefulness MAML pretraining quickly building reasonably ASRs for low-resource languages. significantly outperforms joint training its capability few-shot learning faster adaptation. On average, yields absolute improvements 5.4% CER 20.3% WER over fast-adaptation setting with five epoch fine-tuning. Further, exploit similarities source transcriptions target through loss-weighing scheme during improve performance models. Similarity-based loss-weighings yield 0.2% 1% on average.
منابع مشابه
New distance and similarity measures for hesitant fuzzy soft sets
The hesitant fuzzy soft set (HFSS), as a combination of hesitant fuzzy and soft sets, is regarded as a useful tool for dealing with the uncertainty and ambiguity of real-world problems. In HFSSs, each element is defined in terms of several parameters with arbitrary membership degrees. In addition, distance and similarity measures are considered as the important tools in different areas such as ...
متن کاملEvaluation of Similarity Measures for Template Matching
Image matching is a critical process in various photogrammetry, computer vision and remote sensing applications such as image registration, 3D model reconstruction, change detection, image fusion, pattern recognition, autonomous navigation, and digital elevation model (DEM) generation and orientation. The primary goal of the image matching process is to establish the correspondence between two ...
متن کاملSOME SIMILARITY MEASURES FOR PICTURE FUZZY SETS AND THEIR APPLICATIONS
In this work, we shall present some novel process to measure the similarity between picture fuzzy sets. Firstly, we adopt the concept of intuitionistic fuzzy sets, interval-valued intuitionistic fuzzy sets and picture fuzzy sets. Secondly, we develop some similarity measures between picture fuzzy sets, such as, cosine similarity measure, weighted cosine similarity measure, set-theoretic similar...
متن کاملWord Similarity Datasets for Indian Languages: Annotation and Baseline Systems
With the advent of word representations, word similarity tasks are becoming increasing popular as an evaluation metric for the quality of the representations. In this paper, we present manually annotated monolingual word similarity datasets of six Indian languages – Urdu, Telugu, Marathi, Punjabi, Tamil and Gujarati. These languages are most spoken Indian languages worldwide after Hindi and Ben...
متن کاملUsing Meta-Languages for Learning
This paper proposes the use of metalanguages to provide an eeective representation scheme for Machine Learning. This proposal is supported by both: { a demonstration of the beneets that metalanguages can bring to Inductive Logic Programming , and { the successful and widespread use of metalanguages in other search intensive areas of Artiicial Intelligence, such as planning.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3300790